-
Notifications
You must be signed in to change notification settings - Fork 236
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add test for selecting a single complex field array and its parent struct array [databricks] #8744
Add test for selecting a single complex field array and its parent struct array [databricks] #8744
Conversation
Signed-off-by: Raza Jafri <[email protected]>
It would be good for description in this PR (if not also the issue #8712) to be more descriptive of the change. The test here is a sampling of tests covered in You should consider combining these tests into a single change. All that's changing between these tests is the output projection. |
Adding to this for clarity:
|
build |
build |
connection failure in premerge |
integration_tests/src/main/python/prune_partition_column_test.py
Outdated
Show resolved
Hide resolved
integration_tests/src/main/python/prune_partition_column_test.py
Outdated
Show resolved
Hide resolved
integration_tests/src/main/python/prune_partition_column_test.py
Outdated
Show resolved
Hide resolved
integration_tests/src/main/python/prune_partition_column_test.py
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/ExecutionPlanCaptureCallback.scala
Show resolved
Hide resolved
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/ExecutionPlanCaptureCallback.scala
Show resolved
Hide resolved
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/ExecutionPlanCaptureCallback.scala
Outdated
Show resolved
Hide resolved
build |
premerge failing with the following Exception which seems to be unrelated to this change. I also see another PR failing with the same exception
|
build |
1 similar comment
build |
@razajafri try to upmerge and then run the premerge tests again. |
build |
I am looking into the CI failure. From the logs, the test that I added is possibly crashing the JVM which is causing It could just be a coincidence that the connection dropped so I will try to kick this off one more time. |
build |
|
build |
2 similar comments
build |
build |
build |
build |
@mythrocks @revans2 can you please take another look |
…arent struct array [databricks] (NVIDIA#8744)" This reverts commit 1fb5fc4. Signed-off-by: Jason Lowe <[email protected]>
* Revert "Create an anonymous subclass of AdaptiveSparkPlanHelper in ExecutionPlanCaptureCallback.scala [databricks] (#8977)" This reverts commit bcc929c. Signed-off-by: Jason Lowe <[email protected]> * Revert "Add test for selecting a single complex field array and its parent struct array [databricks] (#8744)" This reverts commit 1fb5fc4. Signed-off-by: Jason Lowe <[email protected]> --------- Signed-off-by: Jason Lowe <[email protected]>
PR tries to mimic the test in Spark's SchemaPruningSuite.
We create a table with complex fields and try to read only a subfield to see if column pruning is working the way it's supposed to i.e. we are not reading unnecessary columns. e.g. if we have a complex type
contact
with an array offriends
, like soselecting
spark.table("contacts").select(explode("friends").alias("friend").select("friend.first_name")
shouldn't also read the middle_name and last_name of the friends field.fixes #8712
fixes #8713
fixes #8714
fixes #8715